This is a quantized version of the KAT-Dev model of Kwaipilot, processed using the imatrix quantization technology of llama.cpp, aiming to improve the running efficiency and performance of the model in different hardware environments. This version offers multiple quantization levels, from high quality to extreme compression, to meet different memory and computing resource requirements.
Natural Language Processing
Gguf